Transient Range Versioning Based on Redirection 
DESCRIPTION 



BACKGROUND OF THE INVENTION 

[Para 1] The present invention relates generally to the field of versioning. 
More specifically, the present invention is related to range versioning based on 
redirection. 

Discussion of Prior Art 

[Para 2] A common problem in database systems relates to how it handles 
scenarios wherein multiple users try to access the same document. For 
example, when user A opens a document A for updating at time t=l , should 
the database render the committed or uncommitted document to user B at 
time t=2. To address this problem, database servers support various isolation 
levels. 

[Para 3] Figure 1 illustrates some examples of common isolation levels in 
present day database systems. The "repeatable read" (RR) isolation level is the 
most restrictive isolation level wherein locks are placed on all data needed by a 
query, thereby preventing others from updating. On the opposite side of the 
spectrum is the least restrictive "uncommitted read" (UR) isolation level. The 
UR isolation level is commonly referred to as a "dirty read," wherein users are 
able to see changes prior to the commitment of such changes. 

[Para 4] Additional isolation levels exist in the middle of the spectrum. For 
example, a "read stability" (RS) isolation level provides for no change in rows 
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which have been read until the logical unit of work (LUW) ends. Another such 
example of an isolation level falling in the middle of the spectrum is the 
"cursor stability" (CS) isolation level, wherein the row to which the cursor 
points cannot be changed. 

[Para 5] Another way database systems address the problem of multiple 
users trying to access the same data is via the use of versioning. Figure 2 
illustrates a simple example of versioning. At time t=1 , both useri and userz 
are able to access document A. When useri is ready to modify/update 
document A, the database system makes a copy of document A in storage, and 
at time t=2, useri works off the copy in storage as a versioned document, 
while userz is still served document A from its previous location. 

[Para 6] Figure 3a illustrates extends the example of figure 2 to depict 
versioning in page memory. In this example, at time t=1 , both useri and user2 
are able to access document A which is stored on Page 1 of memory. When 
useri is ready to modify/update document A, the database system makes a 
copy of document A into Page 2 of memory, and at time t=2, useri works off 
the copy in Page 2 as a versioned document, while userz is still served 
document A from Page 1 . 

[Para 7] Both the examples of Figure 2 and 3a are examples of versioning 
without redirection. Figure 3b illustrates versioning in page memory systems 
with redirection. In this example, at time t=1 , both useri and userz are able to 
access document A which is stored on Page 1 of memory. When usen is ready 
to modify/update document A, the database system makes a copy of 
document A into Page 2 of memory, and at time t=2, useri works off the copy 
in Page 1 as a versioned document, while userz is redirected to Page 2. 
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[Para 8] The following references provide general background in the area of 
versioning, but none of them teach the present invention's transient range 
versioning based on redirection. 

[Para 9] The patent to Evans et al. (5,805,899) provides for a method and 
apparatus for internal versioning of objects using a mapfile. The Evans et al. 
method discloses the management of versioned objects that comprise a 
software program using a mapfile. It provides methods for executing and 
linking objects of a program by determining the appropriate versions of each 
object to use in execution. However, it should be noted that Evans et al. fail to 
teach the grouping of objects into ranges. Furthermore, Evans et al. fail to 
disclose versioning ranges of objects and storing such versioned ranges in 
either transient or permanent storage. 

[Para 1 0] The patent to Antoshenkov (6,439,783) provides for a range-based 
query optimizer for use with a database system having a set of records. The 
disclosed method scans database records by maintaining interval and range 
information to be able to skip scanning records that do not satisfy queries and 
scanning only those that do. It should be noted that the Antoshenkov 
reference fails to teach versioning ranges of objects and storing such 
versioned ranges in either transient or permanent storage. 

[Para 1 1] The patent to Peloquin et al. (6,449,705) provides for a method and 
apparatus for improving performance of drive linking through the use of hash 
tables. The disclosed method routes requests to volumes by assigning drive 
volumes into zones and using a zone table. Peloquin et al., however, fail to 
teach ranges of objects, versioning ranges of objects, or storing versioned 
ranges of objects in either transient or permanent storage. 
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[Para 1 2] The patent to Odom et al. (6,51 6,320) provides for tiered Piasliing 
for data access. Odom et al. disclose a method for building a tiered index 
structure that includes building a first-level index structure having a 
predetermined number of entries, building a second-level index structure 
having a dynamic number of entries, and establishing a link between an entry 
in the first-level index structure and an entry in the second-level index 
structure. Odom et al., however, fail to teach versioning in a nodal structure. 

[Para 1 3] The patent to Kawahito et al. (6,665,864) provides for a Method and 
apparatus for generating code for array range check and method and 
apparatus for versioning. Kawahito et al., however, fail to teach versioning in a 
nodal structure. 

[Para 14] Whatever the precise merits, features, and advantages of the above 
cited references, none of them achieves or fulfills the purposes of the present 
invention. 

SUMMARY OF THE INVENTION 

[Para 1 5] The present invention provides for a method for transient versioning 
in a storage architecture that manages node ranges, wherein the method 
comprises of: (a) receiving a node modification request (a node deletion 
request, a node insertion request, a node modification request, etc.) from a 
database system; (b) copying a node range where said modification is to be 
made to a transient storage; (c) labeling the copied node range with an 
identifier (e.g., timestamp or LSN); and wherein said labeled node range is 
locatable via said identifier and a hash on said node range. 

[Para 1 6] The present invention also provides for a method for transient 
versioning in a storage architecture that manages node ranges via a node id 
range index, wherein each node is assigned a node id value and a set of nodes 
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form a node range. Each entry in the node id range index points to a node 
range and its range identifier, RID. The method comprises the steps of: (a) 
receiving a node modification request for a range; (b) shadowing nodes in said 
range to a Version IHash Table based on RID; (c) assigning a time identifier 
(e.g., timestamp or LSN) to copies of said range; wherein a node in said 
shadowed range is locatable via said time identifier and RIDs. New readers, 
after a modification, access current nodes through a new RID and old readers 
access old nodes via the same RID, with the shadowed copy being locatable in 
said Version Hash Table by hashing the same RID. In one embodiment, when 
modifications cause nodes in a range to be moved to a new RID, previous 
readers are redirected from the new RID to the old RID via a Redirection Hash 
Table. In another embodiment, when modifications cause nodes in a range to 
be moved to a new RID, previous readers are redirected from the new RID to 
the old RID via an index that describes where old versions are in said Version 
Hash Table. In another embodiment, for range deletions, the range being 
deleted is moved to reserved RID ridFF. 

[Para 1 7] The present invention also provides computer medium carrying 
computer readable program code implementing the above-mentioned 
methods. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[Para 18] Figure 1 illustrates some examples of common isolation levels in 
present day database systems. 

[Para 19] Figure 2 illustrates a simple example of versioning. 

[Para 20] Figure 3a illustrates extends the example of figure 2 to depict 
versioning in page memory. 

[Para 21] Figure 3b illustrates versioning in page memory systems with 
redirection. 
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[Para 22] Figure 4 illustrates transient range versioning based on hash 
redirection. 

[Para 23] Figure 5 illustrates range deletions wherein the range being deleted 
is moved to a reserved RID ridFF. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[Para 24] While this invention is illustrated and described in a preferred 
embodiment, the invention may be produced in many different configurations. 
There is depicted in the drawings, and will herein be described in detail, a 
preferred embodiment of the invention, with the understanding that the 
present disclosure is to be considered as an exemplification of the principles 
of the invention and the associated functional specifications for its 
construction and is not intended to limit the invention to the embodiment 
illustrated. Those skilled in the art will envision many other possible variations 
within the scope of the present invention. 

[Para 25] The present invention provides a mechanism for transient 
versioning in architectures that manage node ranges. In architectures that 
manage node ranges, each node is assigned a node ID value, and a set of 
nodes form a range of node IDs called a node range. To find a node among a 
set of node ranges, an index is used. This index, called node ID range index, 
points to the set of node ranges. Each entry in the index describes one range 
and points to where the range is located. Individual nodes are located by 
finding the correct range in the index. 

[Para 26] When nodes are added to or deleted from a node range, the range 
of nodes can be versioned. The range of nodes can be versioned by copying 
the nodes before changes, to transient storage, and then the original nodes 
are modified. Different versions can be tracked by assigning timestamps to 
each copy of the node range. For example, at time 1 , the range 1 to 1 0 is 
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located in Rl , if at time 2, we insert 8 into 1 to 1 0 in Rl , tPie old version of the 
range 1 to 1 0, without 8 at time 1 , needs to be copied to transient storage, 
labeled that it is from time 1 , and locatable through a hash on Rl . Then old 
readers (looking for 1 to 1 0 at time 1 ) would use the index, check the time 
stamp of R1 which is now at time 2, and hash on R1 to get the old version of 
the range 1 to 1 0 at time 1 . 

[Para 27] Since old versions are found through a hash on the current location, 
a problem exists when node ranges split, merge, move, or gets deleted, 
because the node range index is no longer correct as far as old readers are 
concerned. That is, if the range 1 to 10 in Rl were to split into 1 to 4 in R2 
and 5 to 1 0 in Rl , old readers looking for 1 to 4 hashes R2, but the old version 
1 to 1 0 is actually copied and hashed on Rl . This problem is solved by 
redirecting the old reader to hash on the previous location to find the old 
version. 

[Para 28] Each entry in the node ID range index points to the location of the 
nodes in a range. The location is called the range Identifier or RID. Before 
changes are made In a range, the nodes in a range are copied (shadowed) to a 
Version Hash Table based on the RID. Every copy of the range including the 
current one is assigned a timestamp or LSN. New readers after a change, 
access the current nodes through RID, while old readers access the old nodes 
through the same RID, but hashing it to find the shadowed copy in the Version 
Hash Table. This works as long as long as old and new ranges end up with the 
same RID in the index. However, if changes causes nodes in the range to be 
moved to a new RID, previous readers need to be redirected from the new RID 
to the old RID. These readers will be redirected from the new RID to the old 
RID via a Redirection Hash Table. Instead of a Redirection Hash Table, an 
index that describes where old versions are in the Version Hash Table can also 
be used. 
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[Para 29] Figure 4 illustrates transient range versioning based on hash 
redirection. In figure 4, for example, at LSN20, the range 1 -4 is moved from 
RIDl to RID2 (4.1). The Version Hash Table shadows the range 1-10 in RIDl 
before the change at LSN 1 0 (4.2). Old readers from LSN 11 to 1 9 looking for 
nodes 1 to 4 consult the node range index and get RID2. These readers need 
to be redirected from RID2 to RIDl via the Redirection Hash Table, to find the 
matching Version Hash Table entry for RIDl (4.3). 

[Para 30] For any reader, with ReadLSN, if there are Redirection Hash entries 
with LSN> ReadLSN, but no matching range, no redirection is required to hash 
the Version Hash Table. If there are Version Hash entries with matching 
ranges, but LSN>ReadLSN, then the Redirection Hash needs to be used again 
for redirection. 

[Para 31] At time 20, 

[Para 32] Updater moves range 1-4 from RIDl to RID2 (4.1). This involves 
deleting 1 -4 from RIDl and inserting 1 -4 to RID2. Prior to deletion, 1 -1 0 is 
shadowed using the last pageLSN and hashing RIDl (4.2). Because 1-4 is 
moved to RID2 from RIDl , hash entry for RID2 is created for all readers prior 
to the move, looking for 1-4 to be redirected to RIDl (4.3). 

[Para 33] At time 40, 

[Para 34] Updater moves range 1 -3 from RID3 to RID2 with 4-4 (6.1 ). This 
involves deleting 1 -3 from RID3 and merging 1 -3 to RID2 with 4-4. Prior to 
movement 1 -3 is shadowed (6.2) and 4-4 is shadowed (6.3). Because 1 -3 will 
move to RID2 from RID3, hash entry for RID2 is created to redirect all readers 
prior to move, looking for 1 -3, to hash RID3 (6.4). 



[Para 35] At time 30, 
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[Para 36] Reader LSN 22 looks for 3, finds RID3 pageLSN> readLSN so hashes 
RID3 on Redirection Hash, finds first entry with LSN> readLSN containing 3 in 
(5.3). Hashes RID2 on Version Hash, finds last entry < LSN 22 containing 3 in 

(5.2) . 

[Para 37] At time 30, 

[Para 38] Reader LSN 1 8 looks for 3, finds RID3 pageLSN > readLSN so hashes 
RID3 on Redirection Hash, finds first entry with LSN > readLSN containing 3 in 

(5.3) . Hashes RID2 on Version Hash, looks for entry < readLSN containing 3, 
not found since first entry already has pageLSN > readLSN so hashes RID2 on 
Redirection Hash, finds first entry with LSN > readLSN containing 3 in (4.3). 
Hashes RID1 on Version Hash, finds last entry < LSN 1 8 containing 3 in (4.2). 

[Para 39] At time 40, 

[Para 40] Reader LSN 32 looks for 4, hashes RID2 on Redirection Hash, no 
entry with 4 and LSN > 32, hashes RID2 on Version Hash Table, finds last entry 

< LSN 32 in (6.3). 

[Para 41] At time 40, 

[Para 42] Reader LSN 22 looks for 4, hashes RID2 on Redirection Hash, no 
entry with 4 and LSN > 22, hashes RID2 on Version Hash Table, finds last entry 

< LSN 22 in (5.2). 

[Para 43] At time 50, 

[Para 44] Reader LSN 1 1 looks for 4, hashes RIDl on Redirection Hash, finds 
first entry with 4 and LSN > 1 1 in (7.4). Hash RID2 on Version Hash, no entry 
with LSN < 1 1 , hashes RID2 on Redirection Hash, finds entry with LSN > 1 1 



Page 9 of 25 



containing 4 in (4.3). HasPies RIDl on Version Hasli, finds last entry witPi LSN 
< 1 1 containing 4 in (4.2). 

[Para 45] Figure 5 Illustrates transient range versloning based hash 
redirection, wherein Range Deletions can be treated as If the range being 
deleted is moved to reserved RID ridFF. The reader should hash the 
Redirection Hash Table on ridFF to find the correct Version Hash Table entry. 

[Para 46] At time 40, 

[Para 47] Updater deletes range 4-4 in RID2 (6.1). Because 4-4 is deleted, it 
is shadowed in (6.2). 1 -3 is shadowed on the assumption that It needs to be 
updated to know that 4-4 Is deleted (6.3). A Redirection Hash entry Is added 
for ridFF In (6.4) for the deleted range 4-4. 

[Para 48] Reader LSN 32 looks for 4, hashes on the Redirection Hash for ridFF 
in (6.4). Hashes RID to find the Version Hash entry In (6.2). 

[Para 49] Hence, the present Invention's versloning approach favors new and 
recent readers as opposed to old readers, as old versions of data are copied 
out (for example, to transient storage), and old readers have to be redirected 
to them. Therefore, old readers are penalized, but new and recent readers are 
favored as they are least likely to be redirected. 

[Para 50] The present Invention's approach also helps clustering, as old 
versions of data are copied out to temporary storage and remaining space is 
used to store new data. The latest data Is always kept In or near the same 
place where the old data was. 
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[Para 51] Additionally, the present invention provides for an article of 
manufacture comprising computer readable program code contained within 
implementing one or more modules for transient versioning based in 
redirection. Furthermore, the present invention includes a computer program 
code-based product, which is a storage medium having program code stored 
therein which can be used to instruct a computer to perform any of the 
methods associated with the present invention. The computer storage 
medium includes any of, but is not limited to, the following: CD-ROM, DVD, 
magnetic tape, optical disc, hard drive, floppy disk, ferroelectric memory, flash 
memory, ferromagnetic memory, optical storage, charge coupled devices, 
magnetic or optical cards, smart cards, EEPROM, EPROM, RAM, ROM, DRAM, 
SRAM, SDRAM, or any other appropriate static or dynamic memory or data 
storage devices. 

[Para 52] Implemented in computer program code based products are 
software modules for: (a) receiving a node modification request for a range; 
(b) shadowing nodes in said range to a Version Hash Table based on RID; (c) 
assigning a time identifier to copies of said range; wherein a node in said 
shadowed range is locatable via said time identifier and RIDs. 



CONCLUSION 

[Para 53] A system and method has been shown in the above embodiments 
for the effective implementation of transient range versioning based on 
redirection. While various preferred embodiments have been shown and 
described, it will be understood that there is no intent to limit the invention by 
such disclosure, but rather, it is intended to cover all modifications falling 
within the spirit and scope of the invention, as defined in the appended claims. 
For example, the present invention should not be limited by 
software/program, computing environment, or specific computing hardware. 
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[Para 54] The above enhancements are implemented in various computing 
environments. For example, the present invention may be implemented on a 
conventional IBM PC or equivalent, multi-nodal system (e.g., LAN) or 
networking system (e.g., Internet, WWW, wireless web). All programming and 
data related thereto are stored in computer memory, static or dynamic, and 
may be retrieved by the user in any of: conventional computer storage, display 
(i.e., CRT) and/or hardcopy (i.e., printed) formats. The programming of the 
present invention may be implemented by one of skill in the art of database 
programming. 
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